# Linear Regression-Based Power Analysis for Digital Electronic Systems

Y. A. Durrani<sup>1</sup>

<sup>1</sup>Electronic Engineering Department, UET, Taxila, Pakistan <sup>1</sup>yaseer.durrani@uettaxila.edu.pk

Abstract-This paper presents the linear regressionbased power estimation technique uses the input patterns with the predefined characteristics. These patterns help to analyze the power consumption of the different intellectual-property (IP) cores in digital electronic systems. Our technique can accurately deals with combinational as well as sequential logic circuits at register-transfer level (RTL). Genetic algorithm (GA) is used for the generation of sequences with the statistical characteristics. During the power analysis procedure, the Monte Carlo simulation is performed and model function is extracted with the help of Look-Up-Table (LUT) approach. This function uses IPs primary inputs values for the analysis of power consumption. Our model demonstrates accurate and fast power estimation for IP-based digital system.

*Keywords*-Digital Systems, Power Estimation, Macro-Model, Input Metrics.

# I. INTRODUCTION

Low-power consumption is nowadays very important design goal in very-large-scale integrated (VLSI) circuits. A key objective in low power-based system is to analyze power accurately. Power estimation at high design abstraction level, such as electronics and software engineering, is called for to facilitate new solutions to efficient power problems. Hence, a design and estimation approach for lowpower is the key challenge to a successful System-onchip (SoC) design. The rapid growth of transistor density and the operational frequency in digital electronic circuits have made power an important design constraint. Power optimization and estimation of SoC has nowadays become a difficult task for which the conventional approaches often prove to be inadequate. In order to handle the power, electronic design automation (EDA) tools have been introduced. These tools are very helpful for the minimizing of power dissipation in digital devices. The accurate power estimation EDA tools are needed from high to low abstraction levels.

In literature, there are several power analysis techniques for estimating the power consumption at different abstraction levels. Power estimation can be categorized in dynamic or static approaches at low level. Dynamic approaches are efficient and explicitly simulate the circuit with a typical input signal sequences. Statistical approaches such as Monte Carlo zero delay simulation [i] that helps the problem of input pattern dependencies. However, this technique assumes the transition activities and signal probabilities are independent and may therefore give inaccurate results. On the other side, static approaches do not simulate the design. Instead, they are dependent on the static knowledge (like correlations, switchingactivities and signal-probabilities) about the input signal and estimate the same statistical properties for the internal circuit to measure the average power dissipation of the circuit. The input-dependence problem is faced by using selected statistical characteristics of the input patterns. These approaches are accurate and fast which can be obtained from large simulation results.

It is observed that the highest power optimization can be achieved at higher-level of the design abstractions. High level approaches can be roughly split into two types: bottom-to-up and top-to-down. The bottom-up approaches [ii-iv] are more accurate with the reused IP modules. In those modules, the internal details of the design are given and the power macro-model can be constructed easily with the power characterizations that are helpful in low to high level simulation estimates. In top-to-down techniques [v-vii] the design is specified without the technical information on the design implementation. Therefore these techniques are only helpful when the design was not previously implemented. However, they may not have accurate power analysis due to the insufficient information of the design implementation details.

RTL approaches [viii-ix] have been proposed to measure the switching activities of the design. These approaches are accurate but relatively slower and expensive, which limits the length of input streams. Power consumption in digit electronic systems are input pattern dependent. To measure accurate power dissipation, set characteristics of input samples are needed. Usually these input samples have large sample size. If samples are selected arbitrarily, may not be able to measure the expected behavior of the power dissipation of the circuit. To handle this important problem, vector compaction approach was introduced in [x]. This approach was used in input patterns into a Stimulus sequence which has smaller size. These approaches are more accurate but they can not efficiently considered factors like propagation delay, glitch-generation etc. RTL power macro-modeling technique was developed with the variety of equations and Look-Up-Table based approaches. Among those, building power macro-model for the digital system was popular technique for analysis of the average power dissipation. In the literature survey, most power model increases in complexity to capture the accuracy requirements. To reduce the simulation time, input sample compaction technique was introduced that captures the exact power behavior of the circuit.

In this work, we used our research in [xi-xiv] developing for the low-power modeling approaches in the power estimation method for more complex digital test system. Our proposed power model in this paper is the linear regression-based LUT approach at RTL. We use genetic algorithm to investigate the statistical properties of the input sample streams that influences power consumption of digital system. The average input statistical characteristics of our model are the transition-density 'TD', signal-probability 'SP', spatial-correlation 'SC' and temporal-correlation 'TC'. We use IP-based macro-modules for our experiments and achieve comparatively good accuracy.

The paper is organized as follows: Initially we give the brief problem statement and we discuss the introductory background of our linear regression-based power analysis method. Then we explore our GA algorithm and the model is evaluated in experimental results. Finally, we summarizes our work.

#### **II. PROBLEM STATEMENT**

The problem solved in this paper is to find a statistical method which accurately and fast estimates the power consumption in any digital system by the addition of its each IP-module. This method makes the power analysis of digital system an easy task that helps for the analysis of power features at higher level.

## III. LINEAR REGRESSION-BASED POWER ANALYSIS

The flow of our high-level power macro-modeling method is shown in figure 1. Our approach consists of the following steps:

- 1) Characterization of each IP module at high-level design library by simulating it under pseudo random signals and fit statistical variables regression curve to the power dissipation results using a least-mean-square (LMS) error fit.
- Extractions of the power function from parameter model of IPs with using Monte Carlo simulation approach. The high-level simulator is used to collect power values for various IP modules in the SoC System.

 Evaluation of the power macro-model function at high-level IP design which are found in the library by plugging the parameter values in the corresponding macro-model function.

The proposed model is linear regression-based LUT approach. To obtain power macro-model, a linear function is used in (1) that estimates power of the given input vector streams.

$$P_{IP_{avg}} = \beta_0 + \beta_1 \cdot \alpha_1 + \beta_2 \cdot \alpha_2 + \dots + \beta_{n-1} \cdot \alpha_{n-1} + \beta_n.$$
  
$$\alpha_n + \varepsilon \tag{1}$$

Where  $P_{IP\_avg}$  is the average power dissipation of the individual IP core,  $\beta_0, \beta_1, \dots, \beta_{n-1}, \beta_n$  are the regression coefficients obtained from the regression analysis,  $\alpha_1, \alpha_2, \dots, \alpha_{n-1}, \alpha_n$  are the statistical characteristics of each input and  $\epsilon$  is the error. The parameters of the regression are determined using the linear regression by finding the least-square fit. Equation (1) can be expressed in (2):

$$P_{IP_{avg}} = \beta_0 + \beta_1 \cdot TD + \beta_2 \cdot SP + \beta_3 \cdot SC + \beta_4 \cdot TC + \varepsilon$$
(2)

where *TD*, *SP*, *SC*, *TC* are the statistical characteristics of our model.

The regression equation in (2) can be computed by applying the set defined input pattern values of *TD*, *SP*, *SC*, and *TC*. The determination coefficient  $p^2$  is measured to improve the quality of (2). It determines the proportionality of data set patterns of the input characteristics that helps to predict the accurate power.  $p^2$  varies from 0 to 1 and it is defined in (3):

$$p^2 = 1 - \frac{\epsilon_s}{r} \tag{3}$$

Where  $\epsilon_s$  and r is defined in (4) and (5):

$$\epsilon_s = \sum (x_i - y_i)^2 \tag{4}$$

$$r = \sum (x_i - \bar{x})^2 \tag{5}$$

With  $\bar{x}$  is the mean of the estimated data,  $x_i$  predicts the values and  $y_i$  is the data set value.

The proposed signal generation algorithm generates input streams of different values with the characterizations of *TD*, *SP*, *SC*, and *TC*. These values with different combination stored in LUT that allows the power analysis with minimum error. It measures the average power dissipation  $P_{IP}$  Module in (6).



Fig. 1. Power estimation methodology.

$$P_{IP\_Module} = f(TD, SP, SC, TC)$$
(6)

The f(.) is a mapping-method to be performed during the characterization of the input characteristics. To find function f(.), we must sample a set number of streams with several *TD*, *SP*, *SC*, and *TC* values. The power dissipation of IP modules-based system  $P_{IP\_Module}$ is obtained in (7) using (6).

$$P_{IP\_System} = \sum_{i=1}^{n} P_{IPi\_Module}$$
(7)

In this technique, the pattern of input singles are generated according to the input statistical properties (TD, SP, SC, and TC) and power dissipation  $P_{IP Module}$  is predicted in (6). Then Monte-Carlo zero delay method [viii] is used with several input streams of their statistical knowledge and evaluated the accuracy of  $P_{IP Module}$ . The accuracy of the model on each module requires information of the statistical signals. While maintaining accuracy to obtain this knowledge, several simulations are performed with various values of TD, SP, SC, and TC of each IP module. In Fig. 2, the inputs of the module *IP-M* are the inputs of the IP-based digital system and outputs of *IP-M* are inputs of *IP-M*<sub>1a</sub>,  $IP-M_{1b}$ ,  $IP-M_{1c}$ , and  $IP-M_{1d}$ , and continues till the final outputs. The output statistical characteristic values for each IP module can be used as input signal statistical values of the related connected IP modules. For IP-M module, we generate random signals of several different patterns of TD, SP, SC, and TC. Then to build LUT, we performed several simulations in our test system. For each module different values of average input statistical properties are predicted by using large simulations. The different values of the system's power dissipation are determined using power simulator. The simulated power is compared by using HSPICE simulator with estimated power model  $P_{IP\_System}$  in (7).

### **IV. ALGORITHM EXPLORATION**

Genetic algorithm generates randomly input sequences of *TD*, *SP*, *SC*, and *TC* for the power estimation of IP-based test system. Finding the optimal solution that satisfies the convergence criteria of the power waveforms during simulations of the system is the main purpose of power estimation. Our GA procedure is as follows:

- 1. Generate an initial population  $P_i$ , and then set  $p_{cross}$  and  $p_{mut}$  which are crossover and mutation probabilities respectively. Where  $p_{cross} \in (0,1)$  and  $p_{mut} \in (0,1)$ . Set the generation counter with *counter*  $t \coloneqq 1$ .
- 2. Evaluate the fitness function value F for all possible chromosomes in  $P_{chrom}$  with the selection of an intermediate population  $P'_{chrom}$  from the possible current population  $P_{chrom}$ .
- 3. Generation of random number from (0,1) with every chromosome in *P*'<sub>chrom</sub>, then add to evolve each chromosome to the parent set *PS*<sub>chrom</sub>, if



Fig. 2. IP-based test system

related number is less than  $p_{cross}$ . Following steps must repeat until all parents in  $PS_{chrom}$  are mutually mated: (a). Select two parents  $p_a$  and  $p_b$ from  $PS_{chrom}$ . Mate  $p_a$  and  $p_b$  to regenerate children  $c_a$  and  $c_b$ . (b). The children pool set

 $PS_{child}$  updating through  $PS_{child} \coloneqq PS_{child}$  $\mapsto PS_{child} = PS_{child}$ 

$$\cup \{c_a, c_b\} \text{ and } r S_{chrom} \leftarrow r S_{chrom} - \{p_a, p_b\}$$

- 4. Associated random number generation from (0,1) for all possible gene in every chromosome in  $P'_{chrom}$ . Then mutation of this gene is performed, if the associated number is less than  $p_{mut}$  and further add the mutated chromosome only with the children pool set  $PS_{child}$ .
- 5. Finally, if the stopping criteria is satisfactory, then terminate the population generation. Else the selection for the next generation  $P_{chrom+1}$  from

 $P_{chrom} \cup PS_{child}$ . Set  $PS_{child}$ : = {i...i}, set

*counter*\_ $t \coloneqq counter_{t} + 1$ , and go to the step 2.

#### V. EXPERIMENTAL RESULTS

In experimental results, we discuss our accurate technique for IP-based test system is shown in Fig. 2. We have developed this technique and build the power macro-model at architectural abstraction level. Accuracy of genetic algorithm is implemented on IPbased macro-modules. For these modules, random input pattern is generated for several TD, SP, SC, and TC values. The input pattern generation of our new technique is highly auto-correlated. The power consumption is measured by using Monte Carlo zero delay method. Several values are predicted by LUT based approach and these power values are compared with simulation results. Finally, we performed different error analysis to compare our macro-model results with the commercial tool. The maximum, minimum, average and root mean square (rms) errors are computed using (8) and (9).

$$\varepsilon_{\rm rms} = \frac{1}{n} \sum_{i=0}^{n} \sqrt{\left(\frac{P_{\rm sim_i} - P_{\rm est_i}}{P_{\rm est_i}}\right)^2} \tag{8}$$

$$\boldsymbol{\varepsilon}_{avg} = \frac{\sum_{i=0}^{n} P_{sim_i} - \sum_{i=0}^{n} P_{est_i}}{\sum_{i=0}^{n} P_{est_i}}$$
(9)

Where  $P_{sim}$  and  $P_{est}$  are the simulated and estimated powers respectively.

The results demonstrated that the proposed GA can generate random patterns with high convergence and produce accurate statistical values. For the input pattern characterizations of *TD*, *SP*, *SC*, and *TC* we select the probabilistic range of [0, 1]. Our algorithm generated 740 different input streams with width of 8, 16 and 32-bits. The signal stream length is 4000 for macro-module. Analysis of convergence on power may help us to find the simulation interval, by determining when the power value is more close to the appropriate interval length in the steady state waveform. The sequence generator has high value of convergence and uniformity. The appropriate length is found 3650 for the test system. The warm up interval is close to 550 and the steady state interval is obtained 2050.

The power dissipation is observed of  $IP-M_{1a}$ ,  $IP-M_{1b}$ ,  $IP-M_{1c}$ , and  $IP-M_{1d}$ , modules with different input characteristic values. The result demonstrates that among other characteristics, the transition activity TD is more important for the power consumption. The power and TD are considerable linear with each other. The input correlation factors TD and SC are not very effective with the power and not very sensitive as compared to TD. To fit the coefficient's in the model, regression method is obtained. For several modules, the coefficient of correlation is estimated about 94%.

The selected number of the input signals and the

estimate values error obtained with our macro-model function is shown in table 1 and Fig. 3. From Table I that f(.) in (6) is more accurate enough for estimating the average power consumption for IP-based test module. Columns two, three, four and five gives the minimum, maximum, average and rms-errors for the estimates used with our IP-based macro model. HSPICE have been used as power simulator in our power estimation procedure. In experiments, we computed the average minimum error 1.65%, average maximum error 3.49%, average error 2.70% and average rms error 5.93%. Our results demonstrated that our approach can be used to achieve accurate and fast power estimates in the early stage of digital system design. For the entire test system with interconnects/wires the error increases by 12-20%. This error can be improved by implementing different techniques that deal with the data path of interconnects/wires with IP macro-modules. The average error of the entire IP-based test system is 7.15%. The reference results of the system's power dissipation are determined using commercial power simulator. The table illustrates the function has less accuracy in some cases than others. We have found in a given specified range of statistical characteristic values between [0.3-0.7] is more accurate and comparatively less accurate between [0-0.3] and [0.7-1]. The experimental result shows that TD is more dominant factor for power dissipation in the test system and normally proportional to the power measurements. The correlation factors are not very effective to power dissipation and are significantly not more sensitive than the switching density. The number of input/output (I/O) is important to the (I/O) characteristics for the system.

TABLE I POWER ESTIMATION ACCURACY

| Test System   | Emin  | Emax  | Eavg  | Erms  |
|---------------|-------|-------|-------|-------|
| IP - M        | 1.32% | 3.58% | 2.16% | 6.54% |
| $IP - M_{1a}$ | 2.13% | 4.87% | 3.64% | 7.33% |
| $IP - M_{1b}$ | 2.45% | 5.56% | 4.43% | 8.43% |
| $IP - M_{1c}$ | 2.50% | 4.98% | 3.68% | 6.89% |
| $IP - M_{1d}$ | 2.76% | 4.90% | 4.61% | 8.38% |
| $IP - M_{2a}$ | 1.34% | 3.43% | 2.28% | 5.56% |
| $IP - M_{2b}$ | 0.60% | 1.45% | 1.19% | 4.32% |
| $IP - M_{2c}$ | 0.75% | 2.45% | 1.34% | 4.12% |
| $IP - M_{2d}$ | 1.45% | 4.32% | 2.56% | 5.78% |
| $IP - M_{3a}$ | 1.89% | 2.90% | 2.46% | 5.14% |
| $IP - M_{3b}$ | 1.08% | 1.89% | 1.42% | 3.97% |
| $IP - M_{3c}$ | 1.50% | 2.70% | 3.01% | 4.58% |
| $IP - M_{3d}$ | 1.76% | 2.32% | 2.42% | 6.15% |
| Average       | 1.6%  | 3.4%  | 2.70% | 5.93% |



Fig. 3. Error analysis of IP-modules in digital system

## VI. CONCLUSIONS

The presented work is an accurate simulationbased power analysis technique at the architectural level implemented on IP test system. In this work, for each IP module, we measure just 2.70% average and 5.93% rms errors respectively. But for whole test system with interconnects, the average error is found 7.15% and a correlation coefficient is observed around 94%. The results showed the accuracy has been improved in some cases than in others. Nowadays, we are using complex systems for evaluating this model and making it more accurate as compared to other models in this area.

#### REFERENCES

- S. Gupta, and F. Najm, "Power modeling for high-level power estimation," in *IEEE Trans. Very-Large-Scale Integrated System*, pp. 18-29, 8, (1), 2000.
- [ii] S. Gupta, and F. Najm, "Analytical models for RTL power estimation of combinational & sequential circuits," in *IEEE Trans. Comput. Aided Design. Integr. Circuits Systs.*, pp. 808-814, 19, (7), 2000.
- [iii] X. Liuu and M. Papaefthymiouu, "A-staticpower-estimation methodology-for-IP-baseddesign," in *Proc. IEEE Conf. on Design*, *Automation*, & *Test in Europe*, 2001.
- [iv] L. Macchiarulo, E. Macii, and M. Poncino,

"Wire placement for crosstalk energy minimization in address buses," in *Proc. Design, Automation and Test in Europe Conference and Exhibition*, 4-8, pp:158-162, March 2002.

- [v] S. Ravi, A. Raghunathan, and S. Chakfadhar, "Efficient RTL power estimation for large designs," in *Intl. Conf. on VLSI Design*, pp. 431-440, 2003.
- [vi] I. Lee, Y. Choi, Y. Cho, Y. Joo, H. Lim, H. G. Lee, H. Shim, and N. Chang, "Web-based energy exploration tool for embedded systems," in *IEEE Des. Test Comput.*, vol. 21, no. 6, pp. 572-586, Dec. 2004.
- [vii] P. Petrov, A. Orailoglu "Low-Power instruction Bus Encoding for Embedded Processors," in *IEEE Trans. VLSI Systems*, vol. 12, No. 8, pp. 812-826, Aug. 2004.
- [viii] N. Bansal, K. Lahiri, A. Raghunathan, and S. Chakradhar, "Power monitors: a framework for system-level power estimation using heterogeneous power models," in 18th Int. Conf. on VLSI Design, USA, pp. 579-585, 2005.
- [ix] W. Bircher and L. John, "Complete system power estimation: A trickledown approach based on performance events," in *Proc. ISPASS*, Apr. 2007, pp. 158168.
- [x] Y. Cho, Y. Kim, S. Park, and N. Chang, "Systemlevel power estimation using an on-chip bus performance monitoring unit," in *Proc. ICCAD*, , pp. 149154, Nov. 2008.
- [xi] Y. A. Durrani, and T. Riesgo, "Powerestimation-for-register transfer-level-bygenetic-algorithm," in Proc. for Intl. Conf. on Informatics-in-Control-Automation & Robotics, pp. 527-530, 2006.
- [xii] Y. A. Durrani, and T. Riesgo, "Architectural-Power-Analysis for-Intellectual-Property-Based-Digital-System," in *Journal of Low Power Electronics American Scientific*-*Publisher*, Vol.3, No. 3, pp. 271-279, (9), 2007.
- [xiii] Y. A. Durrani, and T. Riesgo, "Power-Estimation-Technique for-DSP-Architecture," in *Elsevier Journal of Digital Signal Processing*, Vol.19, Issue-2, pp.213-219, 2009.
- [xiv] Y. A. Durrani, "High-Level-Power-Optimization-for Array Multipliers," in *Journal* of the Nucleus, Vol.50, No.4, pp.351-358, 2013.